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Abstract. Let G be a finite grapii with tlie non-fet-order property (essentially, a uniform finite 
bound on the size of an induced sub-half-graph). A major result of the paper applies model-theoretic 
arguments to obtain a stronger version of Szemeredi's regularity lemma for such graphs, in which 
there are no irregular pairs, the bounds are significantly improved, and each component satisfies an 
indivisibility condition: 

Theorem 15.181 Let fe* G N and therefore fc** (a constant depending on fc*, but < 2*^*+^) be given. 
Let G be a finite graph with the non-kt -order property. Then for any e > there exists m = m(e) 
such that for all sufficiently large A C G, there is a partition {Ai : i < i{*) < m) of A into at most 
m pieces, where: 

(1) for allij < i{*), \\A,\ - \A,\\ < 1 

(2) all of the pairs {Ai,Aj) are {e, e) -uniform, 

meaning that for some truth value t — t{Ai,Aj) £ {0, 1}, for all but < e\Ai\ of the elements 
of \Ai\, for all but < (^\Aj\ of the elements of Aj , (aRb) = t(Ai, Aj) 

(3) all of the pieces Ai are e-excellent {an indivisibility condition, Definition \5.S\ below) 

(4) if^<^, t/ienm< (3 + £)(§)"" 

Motivation for this work comes from a coincidence of model-theoretic and graph-theoretic ideas. 
Namely, it was known that the "irregular pairs" in the statement of Szemeredi's regularity lemma 
cannot be eliminated, due to the counterexample of half-graphs. The results of this paper show 
in what sense this counterexample is the only essential difficulty. The proof is largely model- 
theoretic (though written to be accessible to finite combinatorialists): arbitrarily large half-graphs 
coincide with model-theoretic instability, so in their absence, structure theorems and technology from 
stability theory apply. In addition to the theorem quoted, we give several other regularity lemmas 
with different advantages, in which the indivisibility condition on the components is improved (at 
the expense of letting the number of components grow with |G|) and extend some of these results 
to the larger class of graphs without the independence property. 



1. Introduction 

This paper applies ideas from model theory to give stronger regularity lemmas for certain natural 
classes of graphs. We first state Szemeredi's celebrated regularity lemma. (The reader is also referred 
to the excellent survey |8j.) 

Recall that if A, B are finite graphs with disjoint vertex sets, the density B) = ^^^^\^^^^ 
and we say that {A,B) is e-regular if for all A' C A,B' Q B with \A'\ > e\A\, \B'\ > e\B\, we have 
that \d{A, B) - d{A' , B')\ < e. 

Theorem A. (Szemeredi's regularity lemma) For every e, m there exist N = N{e, m), m! = m{e, m) 
such that given any finite graph X , of size at least N , there is k with m < k < m' and a partition 
X = Xi U ■ ■ ■ U Xk satisfying: 

(1) \\Xi\ - \Xj\\ < 1 for allij < k 
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(2) all but at most ek"^ of the pairs {Xi,Xj) are e-regular. 

As explained in §1.8 of |8j, "Are there exceptional pairs?" it was not known for some time whether 
the ek'^ irregular pairs allowed in clause (b) were necessary. Several researchers (Lovasz, Seymour, 
Trotter, as well as Alon, Duke, LefFman, Rodl, and Yuster in [1]) then independently observed that 
the half-graph, i.e. the bipartite graph with vertex sets {oj : i < n} U {bi : i < n} (for arbitrarily 
large n) such that UiRbj iff z < j, shows that exceptional pairs are necessary. 

It is therefore natural to ask whether "half-graphs" are the main difficulty, i.e.: 

Question 1.1. Consider the class of graphs which admit a uniform finite bound on the size of an 
induced sub-half-graph. It is possible to give a stronger regularity lemma for such graphs in which 
there are no irregular pairs ? 

A major result of this paper is an affirmative answer to this question. Theorem 1 5 . 1 8 1 b elow . which 
both eliminates irregular pairs and also significantly improves the tower-of-exponential bounds of 
the Szemeredi lemma, which are necessary by work of Gowers 0]. The point of entry to this proof 
is that, as model theorists will recognize, the half-graph is an instance of the order property: 

Definition 1.2. A formula (p{xi, . . . X£;yi, . . . yr) has the order property with respect to some back- 
ground theory T if there exist, in some sufficiently saturated model M of T, elements {a\, . . .a\ : 
i < u}} and . . .bi : i < uj} such that \= ip{a\, . . . a^; 6-^, . . . 6^) if and only if i < j. 

Theories in which no formula has the order property are called stable. Such theories have been 
fundamental to model theory since the second author's work in |13j . see e.g. the "Unstable Formula 
Theorem" II. 2. 2, p. 30. Generally speaking, one contribution of such model-theoretic analysis is to 
characterize global structural properties, such as number of models, existence of indiscernible sets, 
number of types, and so on, in terms of local combinatorial properties, such as the order property 
in some formula. By compactness, a formula has the order property (with respect to a background 
theory T) if and only if it has the A;-order property for every natural number k, Definition 12. 21 below. 
Note, for instance, that in a graph with the non-/c-order property the density between sufficiently 
large e-regular pairs will be near or 1 to avoid the possible extraction of half-graphs. Connections 
between instability and regularity in the context of model-theoretic complexity were investigated in 

The arguments below give several distinct regularity lemmas, and thus a flavor of the utility of 
model-theoretic technology in analyzing regularity. For instance, in the usual proof of Szemeredi's 
lemma, the argument from mean-square density allows for the construction of a partition in which 
the interaction of different pieces is regular; but this need not be because the pieces themselves are 
necessarily atomic or uniform, in their own right. By contrast, a recurring feature of the proofs in 
this paper is the use of stability theory to construct partitions in which the pieces themselves have a 
certain inherent indivisibility; one can then obtain the generic interaction of these pieces (regularity, 
uniformity) nearly for free. Actually, with the exception of ^ we do not require the full first-order 
theory of the graph to be stable, just that the formula xRy have the non-A:-order property for some 
finite k. 

We now describe the structure of the paper. Section [2] contains basic definitions, properties, 
and notation. We then develop a series of partition theorems with different features, illustrating 
certain tradeoffs between indivisibility of the components, uniformity of their interaction, number 
of components and irregularity. Section [3] applies an essential feature of stability, the existence of 
relatively large indiscernible sequences. We first prove, in Theorem 13.51 that in a finite stable graph 
one can extract much larger indiscernible sets than would be expected from Ramsey's theorem. 
The main result of §3, Theorem 13.81 applies [331 to obtain an equitable partition of any stable graph 
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in which the number of pieces grows with the size of the graph; however, the pieces themselves 
are indiscernible, and all pairs interact in a strongly uniform way not superceded in later sections. 
In Section 13.21 we discuss some extensions of these ideas to the wider class of dependent graphs, 
defined there. Section H] takes a different approach to the partition of stable graphs, aimed towards 
addressing the tower-of-exponential bounds. Here too the size of the partition depends on the 
size of the graph, as the "indivisibility" condition remains quite strong (e-indivisible. Definition 
14. 2p . We prove two different partition results. Theorem 14.161 and Theorem 14.221 Theorem 14.161 
gives an equitable partition of a given graph A, with |^| = n, into e-indivisible pieces; there 
is a remainder of size no more than n^, and a small number of "irregular" pairs, however the 
"regular" pairs have no exceptional edges and the total number of pieces is approximately where 
c = c(e) = 1— g'=**+i— 2e^'^**"'"^, for fc** a stability constant from Definition l2.91 A more combinatorial 
approach resulting in Theorem 14.221 allows for no irregular pairs at the cost of a larger remainder. 
Section [5] contains the result mentioned in the paper's abstract. Theorem I5.19| its proof does not 
depend on earlier sections. Theorem 15.191 is a stronger version of Szemeredi regularity for stable 
graphs: the result gives an equitable partition of any sufficiently large stable graph into a small 
number m of pieces (m = m{e), and m < (3 + e) (|) ** for e sufficiently small), such that each 
of the pieces satisfies an indivisibility condition (Definition 15. 2p and all of the pairs are e-uniform 
(Claim [5T6|) . a stronger condition than e-regularity applicable as the densities are all near or 1. To 
conclude. Corollary 15.191 gives a slightly weaker statement of Theorem 15.191 using the terminology 
of regularity. 

Sections [3ll5] can be read independently. The authors are working on improving the bounds in 13.21 
and 13.81 and in 14.111 and on the parallel to ^for A;*-dependent graphs (necessarily with exceptional 
pairs). 

2. Preliminaries 

Notation 2.1. (Graphs) We consider graphs model-theoretically, that is, as structures G in a 
language with equality and a symmetric irreflexive binary relation R, whose domain consists of a 
set of vertices, and where the interpretation Rq consists of all pairs of vertices (a, b) connected by 
an edge. We will often write aRb to indicate that (a, b) € Rq, and write G for the domain of G. In 
particular, \G\, the cardinality ofG, is the number of vertices. 

Definition 2.2. (The non-/c-order property) A graph G has the non-fc-order property when there 
are no ai,bi G G for i < k such that i < j < k =^ (aiRcbj) A -^{ajRcbi). If such a configuration 
does exist, G has the A;-order property. 

Remark 2.3. By the symmetry of R, it is enough to rule out the order in one direction (i.e. the 
non-k-order propery also implies that for no such sequence does i < j ^{aiRcbj) A (ajRcbi). 

Claim 2.4. Suppose G is a graph with the non-k-order property. Then for any finite A C G, 
\{{a e A : aRcb} : b e G}\ < more precisely < ^i<k{^f)- 

Proof See [13] Theorem 11.4.10(4) p. 72 and Theorem 1.7(2) p. 657. □ 

Definition 2.5. (Indiscernibility) Let M be a model, let T be a set of formulas in the language of M 
and a an ordinal. Recall that a sequence {ai : i < a) of elements of M is said to be a F-indiscernible 
sequence if for any n < lo, any formula 7 = 7(2:0, . . . Xn-i) € T and any two increasing sequences 
io < ■■■ < in-i, jo < ■■■ < jn-i from a, we have that M \= 7(0^0, . . . ai„_^) iff M \= 'y{ajo, . . . aj^_^). 

Notation 2.6. Let ip be a formula. Then we identify ip^ = -^ip, ip^ = p. We also identify "true" 
with 1 and "false" with 0, so that in particular the intended interpretation of cp^ , where X is an 



4 



M. MALLIARIS AND S. SHELAH 



expression which evaluates to either true or false, is simply ip or -199, as appropriate. Likewise, the 
intended interpretation of expressions like "xRa = t", where t € {0,1} or is an expression which 
evaluates to true or false, is "xRa if and only if t = 1," or equivalently, iff t is true. 

Definition 2.7. (The set A^) Let he the set of formulas {xqRxi} U {'■p\j^ : m < k,i ^ {1,2}} 

where 

^l,m = Vlmi^o, . . . , Xfc-i) = (3y) I /\ {x.Ryyf ^'='^ A /\ (^=2) | 

\£<m m<e<k J 

Observation 2.8. Let H be a finite graph, and let A = (oi : i < a) be a Ak-indiscernible sequence 
of elements of H where a > 2k. Suppose that for some increasing sequence of indices io < ■ ■ ■ < 
i2k-i < ct and for some element b (z H the following holds: 

• for all I such that {)<(.< k — \, bRnaii and 

• for all I such that k < I < 2k, -^bRfjai^. 

Then H has the k-order property. 

Proof. For each m with < m < A;— 1, consider the sequence {cj : < j < k—1) given by cj := Om+i- 
Then b witnesses that ^{cq, . . . Cfc_i) is true in H. As any two increasing subsequences of A of 
length k satisfy the same Afc-formulas, this easily gives the fc-order property. 

Note that if we had assumed the inverse, i.e. for all i such that < i < k — 1, ^bRnOi^ and for 
all i such that k < £ < 2k, bRnai^, we again get the fc-order property (the same proof works with 
(/j^ replacing 99^ , in the notation of Definition 12.70 . □ 

Below, we will consider graphs with the non-A;*-order property (and reserve the symbol /c* for 
this bound). We define an associated bound A;** on tree height: 

Definition 2.9. (The tree bound k^^^) Suppose G does not have the k.^- order property. Let fc** < uj 
he minimal so that there do not exist sequences a = (a^ : r] G ^**2) and b = {bp : p ^ fc**>2^ of 
elements of G such that if p^{£) ^ rj ^ ^**2 then [a^^Rbp) = {£ = 1). 

Remark 2.10. In general, given a formula ip{x;y), (i) if ip has the non-k^^^-order property, then 
A:** exists and A;*^, < 2^^*+^ — 2. Conversely (ii) if k^^ is as in Definition \2.9l then cp has the 
non-2^**^^ -order property. See Hodges [B] Lemma 6.7.9 p. 313. 

It will also be useful to speak about the average interaction of sets. 

Definition 2.11. (Truth values t) By a truth value t = t[X,Y) for X,Y C G, we mean an 
element 0/ {0, 1}, where these are identified with "false" and "true" respectively. When X = {x}, 
write t = t{x,Y). The criteria for assigning this value will be given below. 

Definition 2.12. (Equitable partitions) We will call a partition of A C G into disjoint pieces 
{Ai : i < m) equitable if for all i < j < m, \\Ai \ — \ Aj\\ < 1. 

Notation 2.13. (Distinguished symbols) Throughout this article, e,^,^ are real numbers in (0, 1). 
We use p,rj for zero-one valued sequences "2, usually in the context of trees. (Following logical 
convention, a given natural number n is often identified with {0, . . .n — 1} .) The letters x, y, z are 
variables, and i,j,k,£,m,n denote natural numbers, with the occasional exception of the standard 
logical notation £{x), i.e. the length of the tuple x. T is a first-order theory, unless otherwise 
specified the theory of the graph G under consideration in the language (=vocabulary) with equality 
and a binary relation symbol R. 
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The symbols k^,, k^^, m^, m** are distinguished. When relevant (the conventions are given at the 
beginning of each section), k^ is such that the graph G under consideration has the non-k^,-order 
property, Definition \2.2[ and k^,^ is the associated tree bound, Definition \2.9l (The one exception is 
in which k^ is such that the graph under consideration is k^- dependent.) The relevant sections 
all compute bounds based on k^, so it is useful, but not necessary, to assume /c* is minimal for 
this property. Likewise, various arguments in the paper involve construction of a rapidly decreasing 
sequence {mi : i < A;*^,) of natural numbers. In context, we use and m** to refer to the first and 
last elements of the relevant sequence, i.e., tuq and mk^^_i respectively. 

G is a large graph, usually finite; A,B,X,Y... are finite subgraphs of the ambient G. Alternately, 
one could let G be infinite, while restricting consideration to its finite subgraphs. 

3. A PARTITION INTO INDISCERNIBLE PIECES 

Classically, the hypothesis of stability implies that in infinite models, one can extract large 
indiscernible sequences (Definition 12.51 above). More precisely, given A an infinite cardinal, M a 
model whose theory is stable in A and A,I<^M with |^| < A < |/|, there is J C I, |J| > A such 
that J is an ^-indiscernible sequence (in fact, an A-indiscernible set); see [13] Theorem 2.8 p. 14. 

In this section, we begin by proving a finite analogue of this result. Theorem 13. 5^ which shows 
that a finite stable graph will have relatively large indiscernible subsequences (in fact, subsets) 
compared to what one could expect from Ramsey's theorem. We apply this to give an equitable 
partition of any stable graph in which the number of pieces is much larger than the size of those 
pieces; the gain, however, is that the pieces in the partition are themselves indiscernible sets, there 
are no irregular pairs, and the condition of "regularity" is very strong. Namely, to each pair of 
pieces {A, B) we may associate a truth value tA,B such that there are at most a constant number of 
exceptional edges {aRb ^ t^^^). This is not superceded in later sections. Moreover we can extend 
some results to unstable dependent theories T, see §3.21 

Hypothesis 3.1. Throughout 321 G is a finite graph with edge relation R which has the non-k-t-order 
property. 

The next claim will be applied to prove Crucial Observation 13.61 below. 

Claim 3.2. If m > 4A;* and {oi : i < m) is a A^^ -indiscernible sequence in G, and b & G, then 
either \{i : aiRb}\ < 2k^ or \{i : -i(aji?6)}| < 2fc*. 

Proof. Suppose for a contradiction that both Y = {i : OiRb} and X = {i : -i(aji?6)} have at least 
2/c* elements. Let ii be the A;=i,th element of X and let Z2 be the A;*th element of Y. Clearly ii 7^ Z2- 
Case 1: ii < i2. By assumption, we can find a subsequence Uj-^ < ■ ■ ■ < aj^,^ < Oj^^^j^^ < ■ ■ ■ < 
«j2fc* — '^"1 such that {ji < • • • < j'fc* = «i} ^ X and {^2 = jk+i < ■ ■ ■ < J2fc, } C Y. Observation 12.81 
gives the /c*-order property, contradiction. 

Case 2: i2 < ii- Similar argument, replacing R by ^R (since R is symmetric, it is equivalent). □ 

Definition 3.3. (the notation is from [7]) Let T be a set of formulas, ni a cardinal and n2 an 
ordinal (for our purposes these will both be finite). Then ni — t- (n2)T,r,i means: for every sequence 
{ui : i < ni) of elements of G, there is a non-constant sub-sequence {oi- : j < which is a 
V -indiscernible sequence, Definition \2.5[ Replacing 1 by i means that the tuples Oj in the sequence 
have length I. Usually we suppress mention of T = Th{G) and assume l{ai) = 1, and therefore 
simply write ni — )■ (n2)r- 
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Claim 3.4. If rii =^ (^2)'^rA i the usual arrow notation then 

ni ("'2)Afc, 

Proof. Given an increasing sequence of elements of ni of length /c* we may color it according to 
which subset of the formulas of A^^ hold on the sequence, and so extract a homogeneous subsequence 
of order type n2. □ 

As explained in this section's introduction, the advantage of the next theorem is not in showing 
the existence of indiscernible subsequences, which could be obtained by Ramsey's theorem since A 
is finite, but rather in showing that in our context they are much larger than expected: a priori, in 
Claim [331 the minimal ni is essentially Dfc^(n2 + 2l^'), compared to (2) in the theorem below. It is 
possible that versions of this result exist (for infinitary versions see [H]). The statement and proof 
make use of some model-theoretic notions not needed elsewhere in the paper, i.e. types (consistent 
sets of formulas in the given free variables with parameters from a specified set) and R-rank (used 
in the "by definition" clause in Step 2A of the proof; see [13] p. 21, p. 31). 

Theorem 3.5. Assume that k,k2,A are such that: 

(a) A is a finite set of formulas, each with < k free variables, and closed under cycling the 
variables 

(b) For each formula (p{xo, . . . x^^i) € A and any partition {xq, . . . x^}, {x^+i, . . . x^-i} of the 
free variables of(p into object and parameter variables, the formula (p{xQ, . . . x^; X£^i, . . . x^-i) 
has the non-k2-order property. 

Then: 

(1) There exists a natural number r such that for any A d G, \A\ > 2, we have that \S/:^{A)\ < 

(2) For each A = {ai : i < n) there exists u C. n such that: 

• \u\ > n*-2+r^ • I t (2+r)''' j ^ where r is from (1) of the theorem, t is a stability constant 



{the R-rank of A) and k is the number of free variables. 
• {ui : i (z u) is A-indiscernible. 

In particular, ni —?■ {n2)TAk .1 S^^ '^^V '^i ^ {cn2)^'^^^'^ * , for the constant c = t (2+'-)'=* ^ depending 
only on Ak^ , as was just described. 

Proof (1) See [l3] Theorem 11.4.10(4) and 11.4.11(4) p. 74. 

(2) Adding dummy variables if necessary, we may suppose that each G A has the free variables 
Xq, ■ ■ ■ Xk-i- (We may then have to omit k elements at the end.) 

We prove by induction on m < that there is ^ n such that: 



(I) |iim,+i| > yt^j ' '^here r is from clause (1) of the theorem and t is a constant defined 
below 

(II) if zo < • • • < Jo < • • • < J'fc-i are from u.^, < k - m =^ ii = je), and if e A, 
then 

^ f{ai^,,. . . a,j^„ J = ifiojo, . . . aj^_J 
The case m = 0. Trivial: u = n. 

The case m+1. Let Um be given, and suppose \um\ = ^m- Let A™ = {^{xq, . . . Xm-i,ae^-m, ■ ■ ■ Q^m-i)}- 
This case will be broken up into several steps. 
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Step 0: Arranging the elements of Um into a tree W^. By induction on £ < £m choose sets W( C 
Um \ Uj<£ ^ t'^^e order <£ on W<i = Uj<^ such that 

• iii <e j then Oj, realize the same A"*-type over {at : i <^ i} 

• if <£ j) and -'{j <£ i) then aj,ai realize different A"*-types over {a^ : s <£ i, s <i j} 

Call the resulting tree W*, and its order <*:= <£ (so W* is u„t with the tree order <*). 
Step 1: Choosing a branch through VF* suffices, i.e. (II) of the induction. 

Let «rn+i be a branch through of maximal length (i.e. any maximal subset linearly orderd by 
<*). In this step, we verify that a branch indeed satisfies the inductive hypothesis on indiscernibility; 
Step 2 will deal with the hypotheses on size. The key is that in every branch, the type does not 
depend on the last element. 

More precisely, suppose io < • • • < ik-\-> jo < " < Jfc-i are from Um+i, Aei^ < k — m — 1 =^ 
H = je): and if G A. (As we had built the tree W^, by induction, <^ implies < in the sense of the 
order of the original sequence.) Without loss of generality, suppose im-i < jm-i- Then, recalling 
the parameters used in the definition of A"*, 

by inductive hypothesis, since the first m indices agree. 

f{aio ,■■ Oi^-i , aem-m, ■ ■ ■ a^^-i) ^{o.io > • • • «im-2 > ajm-i ' a^m-m, ■ ■ ■ o-^^-i) 

since by construction aj^_^, aj^_^ realize the same A"*-type over ai^, . . . ai^_r^ (again, recall the 
parameters used), and finally 

by inductive hypothesis. We have verified that 

¥'(oio>---«ifc-i) ^ <^(«jo'---ajfc-i) 

based only on the assumption that the first m — 1 indices coincide, which completes the inductive 
step. Having established that a branch through the tree VF* will give condition (II) for the inductive 
step, we turn to computing a lower bound on the size of a branch. 

Step 2: Lower bounds on the length of a branch through VF*, i.e. (I) of the induction. As we have 
established that any branch through would suffice for the inductive hypothesis (II), we now 
establish a lower bound on the length of some branch. Informally, we will call any tree meeting 
the specifications of W^^ from Step "a VF*-tree" . Suppose we build the VF*-tree to be as short and 
wide as possible, given the constraints of construction. In the calculation below, we find a number 
h which is relatively large as a fraction of Um (i-e. Condition (I) of the induction), and such that 
any maximally branching T^*-trce, and therefore any W*-tree, will have a branch of size at least h. 
(It is not claimed that h is optimal, but as will be seen from the construction, it appears to be a 
reasonable approximation.) 

This step will be split up into five parts. 

Step 2A: Partitioning the nodes of the tree using stability rank. Let t = R{x = a;. A™, 2) where R 
is the stability rank; then by definition of this rank, wc cannot embed ^~^^2 in W^,. For s < t let 
Sg = {i (z : above i in the tree we can embed *2 but no more}. VF* will be the disjoint union 

of {Sg : s < t}, and if ii G 5i A i2 G 'S'2 A ii <* ^2 then si > S2. 

Step 2B: Conditions for making as short as possible. Let ht{i) be the height of node i in the 
tree. For £ < h, s < t let = {i : i € Ss,h.t{i) = £}. Then the shortest tree is attained when 
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branching is maximal, i.e. 

By definition of S^, (a) if z G Sg, ht(z) = I then in the tree above z, for each I' > I there is at most 

one j ^ Sg . So to minimize height, we assume (b) if i G Ss, ht(i) = £, then there is one immediate 
successor of i in Sg, and all other immediate successors of i are in Sg-i- For s < t, let 

Us = max{z : i < h,i €z S^^i, \Xl \ = z, for X'g the set of immediate successors of i in Sg} 

i.e. the largest number of "new" elements of Ss which appear immediately following a node in 
Ss+i- (Note that at any given height i, the number of "maintenance" nodes in Ss whose immediate 
predecessor was also in Sg is at most ISg^^] by (a).) We will compute bounds on these constants Qg 
in Step 2D after discussing a key inequality in 2C. 

Step 2C: An expression for the number of nodes. Let h be the height of some maximal branching 
VF*-tree. Continuing the notation of Steps 2A-B, notice that if 

i2il\s'r\<\^^\ 

t=G s=l 

then any maximal branching PF*-tree of height h will not exhaust the elements of Um as nodes, 
since the lefthand side of the expression gives the number of nodes in the tree. In other words, any 
maximal branching W^f-ticc, and thus any VF*-tree, must have a branch of length at least h+1. 

Step 2D: Computing an inequality. In this step, we prove that: 

Y^j2\s\-^\<h^-t.h^ 

First, using (a), (b) from Step 2B: 

(.=G s=l s<t 

We can bound the values ag from 2B by noting that Part (1) of the theorem gives a bound r such 
that there are no more than \AY distinct types over any given set A, \A\ > 1. Since we are counting 
siblings in the tree of a given height i + 1, the A in question is the set of common predecessors, 
whose size is just the height £. That is, for any given s, Ug < and so: 

i2j2\si^\<h^.t.hr 



e=o s=i 

as desired. 



Step 2E: Concluding that (II) holds. Combining 2D with the analysis of Step 2C, when h satisfies 

h t 

^^\S\-'\ <h'^-t-h' < \u^\ 
e=o s=i 

then any maximal branching VF*-tree, and thus any VF*-tree, must have a branch of length at least 
h + 1. In particular, this inequality will hold when 



h < — 
t 



1 

2+r 
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Thus we conclude that W^, has a branch Um+i of length > ^^^^ ^"'"'^ ■ 

This completes the inductive step. 

Thus in k steps we extract a sequence of indices u for an indiscernible sequence; the size of u will 

be at least n*-"^ • (2+r)fc ^ ^ where r is from (1) of the theorem, t is a stability constant (the 

i?-rank of A) and k is the number of free variables. 

This completes the proof of the theorem. □ 

We now return to building a regularity lemma. From Claim [3?2] we know how individual elements 
interact with indiscernible sequences. The next observation shows a uniformity to the individual 
decisions made by elements in an indiscernible sequence. 

Observation 3.6. (Crucial Observation) Suppose that A = {oi : i < si), B = {bj : j < S2) are 

Ak, -indiscernible sequences. Suppose that si > 2/;;* and S2 > (2A;*)^. 

Let U = {i<si: < s2){ajRbi)}. Then either < 2k^ or \U\ > si - 2k^. 

Proof. Suppose the conclusion fails. Let ii be the fc*th member of U., and let 12 be the /c*th member 
of {0, . . . si - 1} \ U. Clearly ii / is- 

Case 1: ii < 12- Choose elements jo < • • • < jk,-i from U and elements jk* < • • • < jk,+k,-i < ■^i 
from {0, . . . si — 1}\U satisfying j^^-i < < «2 ^ Jfc, ■ Recall by Claim [3^2] that each aj^ partitions 
B into a small and large set; for each i < 2A;*, let the "small set" be 

Wg = {i<S2: aj^Rbi ^ (^(3-^^*^ < S2)-(aj,fl6i)) } 
By Claim [3?2] and the definition oiU, each < 2A;*. Thus 



e<2k* 



< {2k^f < \B 



Choose n G {0, . . . S2 — 1} \ U£<2fc Then for all i such that < £ < /c* — 1, bnRai and for all I 
such that k^ < £ < 2k^, -^bnRat. By Observation 12.81 G has the A;*-order property, contradiction. 
Case 2: i2 < ii- Similar, interchanging R and ^R. □ 

Conclusion 3.7. Recall the hypotheses of this section: G is a finite graph with the non-k^^-order 
property. 

If (A) then (B). 

(A) (1) m ^ {n2)TAk„i 

(2) n > nin2 and n2 > (2A;*)^ 

(B) if A Q G, \A\ = n, then we can find A, mi,m2 such that: 

(a) A = {Ai -.i < mi) 

(b) A is a partition of A 

(c) n = n2'mi + m2, ra2 < ni < mi 

(d) For each i, \Ai\ G {n2,n2 + 1} 

(e) Each Ai is either a complete graph or an empty graph (after possibly omitting one 
element) 
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(f) If i ^ j < mi, then (after possibly omitting one element of Ai and/or Aj), for some 
truth value t{AiAj) € {0, 1}, we have that for all but < 2k^ a € Ai, for all but < 2k^ 
b e Aj, aRb = t{Ai,Aj). 

Proof. First, choose mi satisfying n2mi < n < n2mi + ni (so mi > ni by (A)(2)), and let <* be a 
linear order on A. Second, we choose A'^ by induction on £ < mi to satisfy: 

. A',<ZA\{A'^:j<£} 

• \A'f\ = n2 

• if we list the elements of A in <^,-increasing order as (a^^j : i < 712), this is a A^, -indiscernible 
sequence. 

The existence of such A'^ is guaranteed by the hypothesis (A)(1). Since Aj^,^ includes {xRy}, we 
will have (e) by the symmetry of R. 

Third, let {a* : i < 7712) list the remaining elements, i.e. those of A \ IJ{^^ • ^ < n^i}- Let 
Ai := A'^U {a|} if is well defined and Ai := A'^ otherwise. Condition (c) ensures there is enough 
room. This takes care of (a)-(e). 

In Condition (f), we may want to delete the extra vertex added in the previous paragraph. Then 
the Crucial Observation 13.61 applied to any pair {Ai,Aj) gives our condition, i.e. it shows that if we 
choose an element a € Ai (provided we did not choose one of the at most 2A;* exceptional points) 
and then subsequently choose an element b € Aj (all but at most 2k^ of them are good choices) we 
find that a, b will relate in the expected way. This completes the proof. □ 

Theorem 3.8. Let A;*, 71,2 be given with n2 > {2k^:)^. Then there is N = N{n2,k^:) such that any 
finite graph G, \G\ > N with the non-k*- order property admits a partition G = {Gi) into disjoint 
pieces Gi which satisfies: 

(1) for each Gi G G, \Gi\ G {n2,n2 + 1} 

(2) (after possibly omitting one element) each Gi is either a complete graph or an empty graph 

(3) for all pairs Gi,Gj € G (after possibly omitting one element from each) there exists a truth 
value t{Gi,Gj) E {0,1} such that for all but < 2A;* a € Gi, for all but < 2A;* b G Gj, 
aRb = t{Gi,Gj). 

Moreover, N = nin2 suffices for any ni > {cn2)^'^~^^^*'* , as computed in Theorem 13.51 in the case 



where A = A^^ (in that calculation c = uuas a constant depending only on A, i.e. A^^). 

Proof. By Conclusion 13.71 and Theorem 13.51 □ 



Remark 3.9. (1) Note that clause (f) of Conclusion 3.7 is stronger than the condition of e- 
regularity in the following senses. 

• It is clearly hereditary for Cj C Aj, |Cj| > |2A;*p. 

• The density of exception is small: 

|{(a,6) G X Aj : {aRb) = -'tij}| ^ 2k^ ^ 2k^ 

I -^i 1 1 -^j I I -^i I I -^j I 

• // |yli|, \Aj\ are not too small, tij = tj^,. 

• // we weaken the condition that [j-Ai = A to the condition that \A\[J-Ai\ < m2, we 
can omit the exceptional points. It may be better to have \Ai\ G {772, 1} with \{i : \Ai\ = 
1} < ni. 

(2) As for the hypotheses(A)(l)-(2) of the theorem: although Theorem 1 3. 51 will not apply outside 
the stable case, some extensions to the wider class of dependent theories are discussed in 
SectionlKM e.g. ClaimlME 
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3.1. Generalizations. Some natural directions for generalizing these results would be the follow- 
ing. First, as stated, the Szemeredi condition is not a priori meaningful for infinite sets, while the 
condition (f) from Conclusion 13.71 is meaningful, and we can generalize the results of this section 
replacing n,ni,n2,k by infinite Xq, Xi, X2, n. Second, we can allow G to be a directed graph and 
replace {xRy} with a set $ of binary relations satisfying the non-A:,,, -order property. Third, we can 
replace 2-place by n(*)-place where n(*) < w, so $ is a set of formulas of the form (p{xo, . . . Xn-i), 
n < n(*). In this case, the assumption (A)(1) of Conclusion 13.71 becomes 

which can be justified by appeal to one of the following: 

(1) by Ramsey: Ai ^ (As)^"!*^ 

(2) by Erdos-Rado, similarly 

(3) using Erdos cardinals 

or 

(4) use stability: [13] Chapter II, or better (in one model) [?] §5. 

Finally, it would be natural to consider extending the results above to hypergraphs. 

3.2. Remarks on dependent theories. In this brief interlude we discuss some extensions of ^ 
to the more general class of dependent graphs. Definition 13. Hi In subsequent sections, we return 
to stable graphs. Although, as discussed in the introduction, the order property is enough to cause 
irregularity, many of the properties considered in this paper are applicable to dependent graphs, 
e.g. Claim [231 and Fact [5lnl 

Dependent theories (theories without the independence property, see below) are a rich class 
extending the stable theories (theories without the order property), and have been the subject of 
recent research, see e.g. p^j and [5j. From the point of view of graph theory and combinatorics, the 
Vapnik-Chervonenkis connection [9] makes this a particularly interesting class. Below, we indicate 
how bounds on alternation can be used to easily deduce a weaker analogue of Theorem [ 



Hypothesis 3.10. G is an ordered graph (or a graph) meaning that it is given by an underlying 
vertex set on which there is a linear order <^ , along with a symmetric binary edge relation R. We 
assume that G is k^-dependent. 

Definition 3.11. Let < uj be given. We say that G is /c*-dependent when there are no a£ G G 
(for i < k^) and bu £ G (for u C k^) such that aiRbu iff i € u. 

Remark 3.12. Stable implies dependent, i.e. if xRy does not have the order property it will not 
have the independence property; but the reverse is not true. More precisely, the formula xRy has 
the order property if, for every n < u, there exist elements oq, . . . such that for all i <n, 

\= (3x) /\ ^{xRaj) A /\ xRoj 

\j<i j>i 

(note this definition remains agnostic about the existence of an x connected to some partition out 
of order) whereas the formula xRy has the independence property (=is not dependent) if, for every 
n < oj, there exist elements ao, ■ ■ - an such that for all u n, 

\= {3x) I /\ -^{xRaj) A /\ xRaj 
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We first discuss the results of ^ On tlie infinite partition theorem for dependent T (existence 
of indiscernibles), see for negative results, and [14J for positive results. 

Notation 3.13. (1) If A Q G then let mem^(^) be the £th member of A under <^ , for I < \A\ 

(for infinite A such that \a is well ordered, £ < (otp(^), \a), an ordinal) 
(2) If AC G, <^ is a well-ordering of A then mem.A{£) is defined similarly. 

Definition 3.14. (Compare Observation 13.61 ) 

(1) We say a pair {A, B) of subsets of G is half-f-nice (for f= (/i,/2)j when: 
for all but < fi(\A\) members of A, 

for all but < f2i\B\) numbers i < \B\ (or otp(i?)), we have that 
aRmeuiBii) = aRmemB{i + 1). 

(2) (restated for clarity:) If f = (ci,C2) where ci,C2 are constants, then in (1) replace the 
condition "all but < fi{\A\) members of A" with "all but < Ci members of A for i = 1,2. 

(3) We say a pair [A^B) of subsets of G is f-nice when {A,B) and {B,A) are both half-f-nice. 

(4) If G is just a graph then the above A,B should be replaced by {A,<a), {B,<b). 

Definition 3.15. Let A^^ = {ipri{xQ, . . . Xk,-i) : ?] € ^*2} where 

(^,(xo, . . . Xfc^i) = {3y) /\ {xeRyyf ^^=1 

£<fc. 

Claim 3.16. Suppose A,BcG are disjoint, both A,B are A^^ -indiscernible sequences, and \ A\ > 
2k^, \B\ > 2k^. Then {A^B) is {\,k^)-nice. 

Proof. Suppose not, so without loss of generality {A^B) is not half-(l, A;=K)-nice. So there is o E ^ 
which "alternates" times on B. That is, we may choose a < '^-increasing sequence of elements 
bzQ, . . . bi^^^ C B such that aRbj (for j even) and ^aRbj (for j odd). Let us verify that this means 
G is /c* -dependent. Let J = {jo, . . ■ jk*-i) be any set of indices of elements of B, of size A;*. Then 
for any cr ^ J, by indiscernibility, we have that 

yea £gj\cr 

since this is true when the appropriate increasing sequence of A;*-many indices (corresponding to 
the pattern of membership in a) is chosen from among io, . . . ^2fc*-i- D 

Definition 3.17. Let A = {Ai : i < i{*)) be a set of subsets of A C G (usually pairwise disjoint). 

(1) We say that A is half-f-nice when i < j < i(*) implies {Ai,Aj) is half-f-nice. 

(2) We say that A is f-nice when i < j < i{*) implies {Ai,Aj) is f-nice. 

Claim 3.18. Assume that 

(1) G is an ordered graph and is k^-dependent 

(2) mi — )• (m2)-j^*^ I in the sense of Ramsey's theorem 

(3) AcG, \A\ = n 

Then we can find {Ai : i < i(*)) such that: 

(1) \Ai \ = 7712 for all i 

(2) the AiS are pairwise disjoint 

(3) each Ai C A and is either complete or edge free 

(4) B = A \ \J{Ai : i < i{*)} has < mi members 
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(5) each Ai is A^, -indiscernible 

(6) A is (1, k^)-nice 

Proof. Straightforward. □ 

Conclusion 3.19. Continuing with the notation of Claim [^.18[ let {Ai : i < be the partition 

of A C G, \ A\ = n into pieces of size m2 obtained there. Suppose that m^\m2, and suppose that 
we divide each A-i into convex intervals Aij each of length m^. If m2 > then we obtain an 

equitable partition into ^ pieces in which, moreover, for any «i 7^ «2 < K*)) 

(1) {{jij2) ■ i^hji, Ai2j2) neither full nor empty} has cardinality < 

(in each pair, each elements alternations can he seen in at most other pairs) 

(2) m,j2):ji,j2<m={^y 



(3) so the density of bad pairs is < 



m2 



3 



2 

On the other hand, the density of pairs with ii = 12 is < 

Remark 3.20. Here we obtain quite small pieces (coming from Ramsey's theorem) and there are 
exceptional pairs; but for the regular pairs there are no exceptions. 



4. On the bounds 

In this section, we take a different approach, aimed at improving the bounds on the number of 
components. First, in a series of claims, we give conditions for partitioning a given graph with 
the non-fc-order property into disjoint e- indivisible sets (Definition 14. 2p . and show when such sets 
interact uniformly. However, the procedure for extracting such sets does not ensure uniform size 
(Discussion 14. 1 1 ( ) . We solve this in two different ways. The first (probabilistic) approach, resulting 
in Theorem l4.16l gives a partition in which there are irregular pairs, but the "regular" pairs have no 
exceptional edges. The second, resulting in Theorem 14. 221 proceeds by first proving a combinatorial 
lemma B. 191 which allows us to strengthen the "indivisibility" condition to one in which the number 
of exceptions is constant; thus in Theorem 14.221 there are no irregular pairs, at the cost of a 
somewhat larger remainder. 

As mentioned in the introduction, one recurrent strategy in this paper is partitioning a given 
graph into "indivisible" components; compare Definition 14.21 with Definition 15.21 Refiecting the 
strength of Definition 14.21 the number of pieces in each of the two partition theorems of this section 
grows with the size of the graph, as in Theorem 13. 8i In Section [5l under the weaker Definition 15.21 
the number of pieces in the partition will be a constant c = c(e) as in the classical Szemeredi result. 

Hypothesis 4.1. Throughout we assume: (1) G is a finite graph. (2) G has the non-k^,-order 
property, and so A;*=k is the corresponding tree-height bound from Definition \2.9i (3) By convention 
f,g are nondecreasing functions from N to N \ {0}. 

Definition 4.2. (e- and /-indivisible) 

(1) Let e € (0, We say that A C G is e-indivisible if for every b £ G, for some truth value 
t, the set {a €z A : aRb = t} has < 1^1*^ members. 

(2) In general, we say that A is f -indivisible (where f : oj ^ oj) if for any b € G, there exists 
a truth value t such that \{a A : aRb ^ t}| < /(|A|). By convention in this section, we 
assume that f is nondecreasing. 
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Claim 4.3. Assume that ttiq > • • • > m^.^^ is a sequence of nonzero natural numbers and for all 
£ < A;**, f{mt) > m^+i (e.g. f{n) = n^). If A C G, \A\ = mo then for some £ < k^^^ there is an 
f -indivisible B G [A]^^ . 

Proof. Suppose not. So we will choose, by induction on k < k^^, elements {b^ : r] ^ ^^2) and 
(A^ : rj e''-2) such that: 

(1) Ao = A 

(2) A^^i^i^ c A^ 

(3) A^.(o>n V(i> = 

(4) \A^\ = mig(^) 

(5) b^GG 

(6) ={a£Ar,: aRb^ = = 1)} 

There is no problem at A; = 0, but let us verify that the induction cannot continue past A;**. 
For each ry G ^^2 A^ / by (4), so choose G A^. If for all r/ G ^**>2 there exists 6,, such 
that A^^ (^i^ , A^^ 1^2) ^'^^ defined and satisfy the conditions, then the sequences (a.^ : r] G ^"2) and 
{brj : rj G ^**>2) contradict the choice of /c**. Definition 12. 9i So for at least one 77, it must be that no 
such brj can be found in G, i.e that for any b £ G, either |{a G A^ : aRb}\ or |{o G : -^aRb}\ is 
less than m^+i, for £ = Ig(ry). Let B = A^, so \B\ = mc and B is /(|S|)-indivisible, which completes 
the proof. □ 



Claim 4.4. Assume f, {mi : £ < A;**) are as in Claim For any A G, we can find a sequence 
{Aj : j < m) such that: 

(a) For each j, Aj is f -indivisible 

(b) For each j, \Aj\ G {m^ : £ < k^^^} 

(c) AjCA\ [j{Ai : i < j} 

(d) A \ [j{Aj : j < m} has < rriQ members 

Proof. We choose Aj by induction on j to satisfy (a)+(b)+(c). If |A| < niQ we are trivially in case 
(d). By Claim HT3l we can continue as long as there are at least tuq elements remaining. □ 

Claim 4.5. Assume e G (0, gjiR; n''- > A;**. Let [ruf : < £ < A;**) be a sequence of integers 
satisfying n > niQ, m^^^ > k^,^, and for all £ s.t. Q < £ < A;**, m^+i = [(m-^)'^] . 

If A Q G, \A\ = n then we can find A such that: 

(1) A = {Ai : i < i{*)) is a sequence of pairwise disjoint sets 

(2) {\Ai\ : i < is <-increasing 

(3) for each i < i{*) for I = mi and Ai is e-indivisible 

(4) A \ {Ai : i < i(*)} has < m^ elements 

Proof. By Claim 231 using /(n) = n'^ and renaming the sets Ai so that clause (3) holds. □ 

The next claim says that for all sufficiently indivisible pairs of sets, averages exist (notice there 
is a potential asymmetry in the demand that B be large). 

Claim 4.6. Suppose A is f -indivisible, B is g-indivisible and and f{\A\) ■ g{\B\) < ^\B\. Then for 
some truth value t = t{A, B) for all but < f{\A\) of the a G A for all but < g{\B\) of the b £ B, we 
have that aRb = t. 

Proof. Similar to the proof of Observation 13.61 above. For each a G A there is, by ^-indivisibility 
of B, a truth value = ta{a,B) such that \{h £ B : aRb = ta}\ < g{\B\). For i G {0,1}, let 
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Ui = {a €z A : ta = i}. If \Ui\ < /(|^|) for either i, we are done, so assume this fails. Choose 
Wi C Ui so that \Wi\ = f{\A\) for i G {0,1}. Again we gather the exceptions: let V = {b e B : 
(3a G Wi)i^aRb) V (3a G Wo)iaRb)}. Then |y| < {\Wi\ + |VFo|)ff(|-B|) < \B\ by hypothesis, so we 
may choose 6* G \ But then a G Wi b^Ra and a G Wq =^ -^b^Ra, contradicting the 

/-indivisibility of j4. □ 

Remark 4.7. When f{n) = n'', g{n) = the translated condition is: if \A\''\B\'^ < ^\B\. 

Claim 4.8. Let A be (^-indivisible and B be e-indivisible. Suppose that the hypotheses of Claim 
are satisfied, so averages exist. Then for all Ci G (0, 1 — C), ei G (0, 1 — e), we have: if A' C A,B' C B, 

\A'\ > \B'\ > \B\'+'\ then: 

{{a,b) e {A',B') : aRb = ^t{A,B)} 



\A'\\B' 



< 



\A\<^ 



+ 



Proof. To bound the number of exceptional edges, recall that in A, hence also in A' , there are at 
most \A\^ elements which do not have the expected average behavior over B. Likewise, for each 
non-exceptional a ^ A' there are no more than \B\^ corresponding exceptional points b & B'. Thus 
we compute: 



\A\i-\B' 



\A'\ - \A\<)\B\ 



\A\<^ 



lA'WB' 



A' 



+ 



\A'\ - UKl 



A' 



\A\^ \B\ 



< 



1^1 



C 



\A'\ ' \B'\ - 

A similar result holds for /-indivisible replacing e-indivisible. 



\B\ 



+ 



\B\ 



\A\Ci 



+ 



□ 



We single out the following special case for Theorem 14.221 below. 
Corollary 4.9. Let A,B be f -indivisible where f{n) = c is a constant function. Suppose that the 



hypotheses of Claim \4^ are satisfied, so averages exist. Then for all Ci G (0, 1- 
we have: if A' cA,B'c B, \A'\ > c|A|fi, \B'\ > c\B\^\ then: 



\A\ 



),ei G (0,1- 



\B\- 



{{a,b) (£ iA',B') : aRb = ^t{A, B)} 



\A'\\B' 



< 



|v4|Ci 



+ 



1 



Shi 



Returning to the general argument, choosing ei (here called C) small enough means we can apply 
Claim 14.81 to any pair of elements from the partition in Claim 14.51 



Claim 4.10. In Claim "^f C ^ (0,e ), we have in addition that for every i < j < i{*), if 
A C Ai, \A\ > \Ai\'^^'^ , B C Aj, \B\ > \Aj\^^'^ and tij = t{Ai,Aj) is the associated truth value, then 



{(a, 6) G iA,B) : aRb = ^Uj} 



\A\\B\ 



< 



\Ai\C 



+ 



A,\C 



< 



+ 



A\<: ' \B\C 



Proof. By Claim HTSl Note that the enumeration along with clause (2) of Claim [^31 (i.e. \Ai\ < \Aj 



ensures tij is defined. 



□ 



Discussion 4.11. In some respects Claim^JjJ^ applied to the partition of Claim \4^ is quite strong: 
(a) There are no irregular pairs, (b) For each pair the number of exceptional edges is very low. On 
the other hand: (c) There is a remainder A\\^-Ai, not serious as we can distribute the remaining 
elements among the existing Ai without much loss, as was done m 321 (d) There is an inherent 
asymmetry: the result assumes i < j < we have not discussed j <i < but this is also not 
serious, (e) The cardinalities of the Ai are not essentially constant: this seems more serious. 
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We give two different resolutions of (e) in the remainder of this section. In Theorem \4-16 



we 



obtain an equitable partition at the price of allowing for irregular pairs. In Theorem \4.22[ we obtain 
much stronger indivisibility conditions on the components and no irregular pairs, at the price of a 
somewhat larger remainder, Theorem \4.22 



4.1. Towards a proof of Theorem 14.161 

Definition 4.12. Assume that A, B are f -indivisible (usually: e-indivisible) , disjoint (for notational 
simplicity), and that f{A) x f{B) < ^\B\ (so t{A,B) is well defined). Let m divide \A\ and \B\. 

We define a probability space: divide A into \A\/m pieces each of size m {Ai : i < i^) and likewise 
divide B into \B\/m parts each of size m, {Bj : j < Jb)- Call this partition an equivalence relation 
E on AUB. 

For each i < iA,j < Jb, 1st St, A.^be the event: for all a E Ai, for all b G Bj, aRb = t{A, B). 

Claim 4.13. Let Ai,Aj be two sets from the conclusion of Claim So e ^ (0, f{x) = [x""]. 
Ignoring a minor error due to rounding to natural numbers, suppose that \Ai\ = m^^ = n'^ " , 
\Aj\ = = n*^ and \Ai\ < \Aj\. Let m be an integer such that m divides both \Ai\ and \Aj\, 
and m = for some ( < e^** . Choose a random partition of Ai and Aj into pieces of size m. Let 
Al , j4* be pieces from Ai and Aj , respectively, under this partition. 

Then Prob(^+ , )>1-^. 

Proof. By choice of Ai,Aj we have that t = t{Ai,Aj) is well defined. Let lAi = {a ^ Ai : \{b € Aj : 
aRb = -it}[ > and for each a £ Ai \ Ui, let U2,a = {& € Aj : aRb = -it}. By definition of t, 

1^1 1 < l^il*^ and for each relevant a, \U2,a\ < l^il*^- 

We first consider Af. The probability Pi that Af DUi 7^ is bounded by the following: 

mm n'^lAil' "^^f^' 



Ai\-m ' \Ai\-m " n"'""^^ 

1 _ 1 

^ _ _l_1 e _ _l_0 r» A _ _L1 /-I ^ n A ^ _ _l_1 ^ 



Now if n Z^i = then |UapA=^2,a| < So the probability P2 that we have A* n 

UaeAf ^2,a / is bounded by: 

"T-l UaGA^^2,a| m ■ m ■ \AA'' n^^\AA^ 1 

P2 < — I < — , ' ^' < 1 < 

\Aj \ - m \Aj\- m \Aj \ — ** 



by the analogous calculation. So Prob(i£^^s ^) > (1 ^1 ^ 

Claim 4.14. Let {mi : £ < /c**) be a sequence which satisfies the hypotheses of Claim l475\ and 
suppose that m^:^ divides mi for i < /c** . Let n be sufficiently large relative to m* : it suffices that 
"i* < (^^^ Remark \4. 15\ ). 

Let A C G, \A\ = n and let {Ai : i < i{*)) be the partition of A given by Claim \4^ with respect 
to the sequence {mi : d. < (we will temporarily ignore the remainder of size < m^). Recall that 
t{Ai,Aj) is well defined for i < j by Claim 

Then there exists a partition {Ci : i < r) of Uj<i(*) ^« such that: 
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(1) {Ci : i < r) refines the partition {Ai : i < i{*)) 

(2) \Ci \ = m^^ for each i < r 

(3) For all but at most J.^^ of the pairs {Ci,Cj), there are no exceptional edges: that is, if 
i < 3, Ci C Ai, and Cj C Aj, then {{a, 6) G Cj x Cj : aRb ^ t{Ai, Aj)} = 0. 

Proof The potential irregularity in a pair {Ci,Cj) comes from two sources. 

(a) The case where Ci C Ai, Cj C Aj, i ^ j and {Ci,Cj) contains some exceptional edges. By 
Claim [1?T3] and linearity of expectation, there exists a partition satisfying (1),(2), in which (3) holds 
when computed on pieces Ci, Cj which came originally from distinct components Ai,Aj. In fact, 

this will be true for all but at most (1 ^ttt)^ of such pairs by the calculation in the last line of 

Claim ma 

(b) The case where Ci,Cj are both from the same original component Ai. Here we have no 
guarantee of uniformity. Let us compute a bound on the fraction of such pairs Ci, Cj. The maximum 
is attained when all of the original components were of maximal size mo = rn^,; in this case the 
number of ways of choosing a pair Ci, Cj from the same original component is at most 

m** — q£ ^ possible I "^** 

so the ratio is approximately 

(—Y 

2 rrt* ^* 

{ n 
2 

Recall that by hypothesis, — < —4 — . 

Combining (a) and (b), the total fraction of irregular pairs does not exceed J,, . □ 

Remark 4.15. In Claim^JjJ^ hypothesis on m* could obviously be weakened, or dropped at the 
expense of increasing the fraction of irregular pairs by as the calculation in part (b ) of proof 
shows. 

Theorem 4.16. Let e = i E (0, /c* and therefore /c*^, be given, and suppose G is a finite graph 

with the non-kit: -order property. Let A <Z C, \A\= n with > A;**. Then there is C < e CLn-d a 

partition {Ai : i < i{*)) of A such that: 

(1) for all i, either \Ai \ = [n^J or \Ai\ = 1 

(2) \{i : \A,\ = 1}| < n" 

(3) ^ > 77(W |{(^,J) : l^^l = 1, l^il = 1, or {{a,b) e Ai x Aj : aRb} i {A, x ^„0}}j 

Moreover, we may choose C ^ (1 — 2e^**)e'^**^^, so the total number of pieces is at most n'^ 

where c = c(e) = 1 - e''**+^ - 2e^''**+^ 

Proof. Recall that e = -. (This hypothesis is just to ensure divisibility, and could be modified or 

fc** 

dropped in favor of allowing for slight rounding errors.) Choose m*=K maximal so that (m**)'' < n, 
and subject to the constraint that ^ < (One can drop this constraint, by Remark 14.151 at 

the cost of increasing the fraction in item (3) by ^.) By hypothesis m** > fc*^,. Then the sequence 
{me : £ < /c*^,) satisfies the hypotheses of Claims HT3] and HTS} and furthermore m^,* divides me for 
£ < Apply Claim [^5] to obtain a decomposition into e-indivisible pieces A'- such that for each 
i and some £, \A'- \ = m^. Claim IT. 141 gives a further partition into pieces {Ai : i < i{*)) each of size 
m**; additionally, we partition the remainder from Claim 113] into pieces of size 1. Let be such 
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that m^,* = n^. This gives clause (1), and clause (2) holds by Claim H31 fd) since mo = (m-**)^ ** • 
Condition (3) holds by Claim [¥.131 Finally, 

□ 

Remark 4.17. Though the number of components grows (solved only in and this regularity 
lemma admits irregular pairs, the regular pairs have no exceptional edges. 

4.2. Towards a proof of Theorem I4.22L In this subsection we take a different approach, and 
obtain a regularity lemma in which there are no irregular pairs, at the price of a somewhat larger 
remainder. The strategy will be to base the partition on a sequence of c-indivisible sets, i.e. sets 
which are /-indivisible for a particular constant function /(x) = c; such sets will then interact in a 
strongly uniform way. [Recall from Definition 14 . 2 1 that e- indivisible for e G (0, 1)r was shorthand for 
/-indivisible when /(x) = x^; this was the only exception to standard notation, and in particular, 
c-indivisible for c € N means /(x) = c] The proof that such sets exist relies on a combinatorial 
lemma 14.191 To motivate the combinatorial lemma, the reader may wish to first look through the 
proof of the existence claim, Claim 14.211 

Definition 4.18. For n, c G N, e,C,^ ^M. let 0[n, e, C, c] be the statement: 
For any set A and a family V of subsets of A, we have 
If (l)\A\=n^ 

(2) \V\<nKv <Z V{A) 

(3) {yB G V){\B\ < n') 

then there exists U Q A, \U\ = [n^\ such that (VS e V) {\U Ci B\ < c). 
Lemma 4.19. If the reals e, (^,^ and the natural numbers n,c satisfy: 

(a) eG (0,1), C>0 

(b) < C < min(l -e,i) 

(c) n sufficiently large, i.e. n> n(e,C,^,c) from Remark \4.20 

(d) c > g(i„\,,) 

i/ien 0[n, e, C, c] holds. 
We delay the proof until after the next claim. Note that in clause (d) we have that c > by (b). 

Remark 4.20. In the statement of Lemma \4-l^ for "n sufficiently large" it suffices to choose n 
such that 



„1-2C ^ ^(l_^_e)c-l/C ^ 

See the last displayed equation in the proof of Lemma \4.19[ As explained there, the hypotheses of 
Lemma \4 ■ 1 9| imply that the two exponents are positive constants, so it is well defined to let n(e, C, ^, c) 
be the minimal n G N for which the displayed equation is true. 

Claim 4.21. Suppose that we are given constants /c,c G N and e,^,C £ such that: 

(1) A<ZG implies |{{a G A : aRb} : fe G G}| < l^]'^' 

(2) 6G(0,i) 

(3) ^ G (0, ^) is such that £, < e^** and for all natural numbers i < /c**, ^ < ^ < 1 — e. 
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(4) the constant c satisfies 



c > 



C(i 



Then for every sufficiently large n G N (meaning n > n(e,C,^, c) in the sense of Remark \4. 20^ if 
A G with \A\ = n then there is Z <^ A such that 

(a) \Z\ = [n^J 

(b) Z is c-indivisible in G, i.e. for any b £ G there zs t G {0, 1} such that for all but k elements 
a £ Z, aRb = t 



Proof. Since G has the non-A:*-order property (see the second paragraph of l2.13p . k = k^^ will satisfy 
condition (1) by Claim 12.41 and recall that k^^ is the associated tree bound from 12.91 For lower 
bounds on the size of n, see Remark 14. 20[ 

Let ^ C G, 1^1 = n be given. For transparency of notation suppose that for each natural number 



^ k^ 



n 



G N, and that G N. We choose mi by induction on £ < /c** so that m^. 



+1 



By Claim there is £ < A;** and Ai ^ A such that \Ai\ = mi and Ai is e- indivisible. Let 
Vi = {{a G Ai : aRb} ibeG}. So \Vi \ < \A\'' = (m^)'=, by choice of k. 

We would like to apply Lemma [4.191 to conclude that 0[e, -I, ^,c] holds ioi A = Ai, V = Vi. 
Let us verify that the hypotheses of that Lemma hold: 



• (1),(2) hold as j^i| = me, and l^il = (me)" 

• (3) holds by definition of Vi, as Ai is e-indivisible 

• (a) clear 

• (b), (d) by choice of ^ and c in this Claim 

• (c) by choice of n "sufficiently large" 



1 

me) 



We conclude that there is Z C Ai (i.e. the lA guaranteed by Lemma l4.19p which is c-indivisible 
and satisfies 



\Z\ 



(meY' 



which completes the proof. 



□ 



We now prove Lemma |4.19[ 

Proof, (of Lemma |4.19|) Let m = \_n^\ ; this is the size of the set U we hope to build. 

Let J-"* = "^A be the set of sequences of length m from A, so = n"*. We will use rj for such a 
sequence and write ri[i] for the value at the ith place. 

Define a probability distribution fi on T Q by: /i(-F) = 

We will show that for n > n(e,C,^,c) in the sense of Remark 14.201 there is nonzero probability 
that a sequence r/ G J-"* satisfies (1) all the elements of t] are distinct, set it has cardinality 

m and (2) for any B £ V there are fewer than k integers t < m such that r][t] G B. This will prove 
the lemma. 

We calculate the relevant probabilities in four steps. 

®i Verifying some inequalities. By assumption (b) of the Lemma, 1 — 2^ > and 1 — ^ — e > 0. So 
by assumption (d) (1 — ^ — e)c— ^>0 and c is a natural number. We proceed to compute several 
probabilities. 
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®2 The probability that rj is not sequence of distinct elements. (This is bounded by the sum over 
s < t of the probabiUty that 7][s] = i][t]: note we don't mind if this happens for more than one pair.) 

VToh{{3s<t<m){ri[s\=ri[t\)) <[]<<< -^-^ < 



®3 The probability that rj intersects a given B in more than c places. (For the bound, choose 
c indices, then choose c values for those places from B, over all possible choices of those values.) 
Let B £ V he given. Then 

Prob {{3^H < m 7? t G S < ^- < < = ^TTTk 

\c ) n'^ n(^~'i~'^>'^ 

®4 The probability that rj intersects some B in more than c places. By ®3, 

Prob {{3B € V){3-H < m){rj[t] G B)) < \V\ ■ (max{Prob < m)(r?[t] e B)) : B e V}) 

1 1 1 
< nc • 



As remarked above, it suffices for the Lemma to show that the sum of the probabilities ®2 + ®4 < 1, 
i.e. that 

1 1 

By ®i, both exponents are nonzero, and moreover they are constant, so the sum will clearly even- 
tually be smaller than 1. □ 

Theorem 4.22. Let and therefore /c** be given. Let G be a graph with the non-k-t- order property, 
and let k = k^ as in the proof of Claim \4^.21\ 

Then for any c G N and e,C G M which, along with k, satisfy the hypotheses of Claim \4T21 , and 
any eeR,0<e<l, 

there exists N = N{k^,, k, c, e, C, such that 

for any ACQ, |^| = n > A^, there is i{*) G N and a partition {Ai : i < of A into disjoint 
pieces (plus a remainder) satisfying: 

(1) \Ai \ = [n^^\ for each i < i{*) 

(2) each Ai is c-indivisible, i.e. indivisible with respect to the constant function f{x) = c 

(3) !^\U<^w^il< [n^' 

Remark 4.23. Recall that the interaction of any two distinct Ai,Aj given by this theorem will be 
highly uniform. Assuming > 2c, average types exist in the sense of Claim and in particular 
the calculations of Corollary \4.9\ apply. 

Proof, (of Theorem 14. 22p Assume that n is large enough so that > n(e, C,^,c) + 1, where n(...) 
is the lower bound from Lemma l4. 191 and Remark 14.201 Note that by choice of k, k satisfies Claim 
[221 

We are aiming for pieces of uniform size n^''. First, given 6, define by induction a decreasing 
sequence {me : £ < A;**) by ruk^^-i = \n^\ , ruk^^ = \ {mk^^-iY \ and for each 1 < j < /c**, mk„-j = 

(mfc_^_^_j+i)~ . This sequence, fixed for the remainder of the proof, satisfies the hypotheses of 
Claim Ol 
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Second, choose a sequence of disjoint c-indivisible sets Ai by induction on i, as follows. Let 
Ri denote the remainder A \ Uj<j^i stage i. Apply Claim to Bi, using the decreasing 
sequence {m^ : £ < A:*^,) just defined, to obtain an e-indivisible Bi C R^. By construction, for some 

ft ^ 

1 < £ < k^^: this set Bi will have cardinality mk„-i = (n )<!*"i . (Note that e-indivisibility need 
not be preserved under taking subsets.) 

By the first line of the proof (recall ruk^^-i = \nP\), we have \Bi\ > n(e,C^,c). Apply Claim 
14.211 to Bi, using c,k,e,C,^ as given, to extract a c-indivisible subset Zi of size \Bi\'^. That IS, 



\Zi 



n 



< 



for some I < £ < A;** 



Since the property of being c-indivisible is 

This 

completes the construction at stage i. 

This construction can continue as long as the remainder Bi has size at least mg 



preserved under taking subsets, choose Ai to be any subset of Zi of cardinality exactly \n^'^ 



required by Claim 231 the remainder must have strictly smaller size, which completes the proof 



7^ 



as 
□ 



5. Regularity for stable graphs 



Thus far, we have given several regularity lemmas for the class of stable (or, in Section 13.21 
dependent) graphs which in some senses improved the classic Szemeredi result, particularly in the 
"indivisibility" of the components; however, in each case the size of the partition given depended on 
1^1 . In this section, we obtain a partition theorem for any graph G with the non-A;,, -order property 
which unilaterally improves the usual result, Theorem l5.18l for each e, there is m = m(e) such that 
all sufficiently large G with the non-A;,, -order property admit an equitable distribution such that 
(1) there are no irregular pairs, (2) each component satisfies a strong indivisibility condition, called 
e-excellence, and (3) the bounds are much improved. For most of the construction, "regularity" of 
pairs means e-uniformity. Claim [5^6] below: this is useful in our context as the density will be close 
to or 1. A translation is given in Claim [5TT71 and Corollary 15.191 is a slightly weaker statement of 
the main result using the familiar definition of e-regularity. 

This section relies on ^ (Preliminaries) for notation and definitions; nonetheless, definitions will 
be referenced the first time they are used. Although this section naturally extends the results and 
strategies of previous sections, it is self-contained and can be read independently. 

Hypothesis 5.1. Throughout 33 we assume: (a) G is a finite graph, (b) for some fixed through- 
out this section, G has the non-k^,- order property. Definition \2.2\ and (c) A;** is the corresponding 
bound on the height of a 2-hranching tree, Definition \2.9l Throughout this section e,C,,£, o.re reals 
G(0,i). 

Definition 5.2. (Good, excellent) 

(1) We say that A C G is e-good when for every b G G for some truth value t = t(6. A) € {0, 1} 
we have \{a ^ A : (aRb) ^ t}| < e\A\. As e < ^, this is meaningful. 

(2) We say that A G is (e, C)-excellent when 

(a) A is e-good and moreover 

(b) if B C G is C,-good then for some truth value t = t{B,A), 
\{aeA: t{a,B) / t(B,^)}| < e\A\. 

Again, as e < ^ the average is meaningful. When e = C, we will just write e-excellent. 

Remark 5.3. Any set A C G satisfying condition (b) for e-excellence must also be e-good, since any 
singleton set {6} is clearly e-good (in fact, e-excellent). Any B which satisfies (Va G G) Vte{o i}(^^ ^ 
B){aRb = t) will also be excellent. 
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The next claim, which will be used repeatedly, gives a way to extract e-excellent subsets of any 
given A by inductively building a tree whose (full) branching must eventually stop. In the statement 
of the Claim, Case (II) abstracts from Case (I) by assigning cardinalities to the levels of the 
tree. 

Claim 5.4. (Crucial claim) Assume e < 

(I) For every A C. G, \A\ > there is A' such that: 

(a) A' QA 

(b) l^'l > e''**-^A\ 

(c) A' is e-excellent 

(II) Alternately, suppose we are given a decreasing sequence of natural numbers (m^ : £ < k„) 
such that em£ > m^+i for i < k^^^ — 1, and rrik^^-i > k^*- Then for every A C. G, \A\ > 
there is A' such that (a),(b)',(c)' hold, where: 

(b) ' l^'l = rui for some £ < k^^ 

(c) ' A' is -excellent {so in particular, e-excellent) 

Proof. The strategy is as follows. Since the proof is essentially the same for Cases (I) and (II), we 
prove both simultaneously by giving the proof for Case (I), and pointing out when the cases differ. 
We will try to choose {A^, B^) by induction on A; < A;** such that: 

(1) 5 = (^^ : r? G ^=2) 

(2) ydfc is is a partition of A, or of a subset of A 

(3) ^0 = A 

(4) If k = m + 1, u £ "^2 then A^, is the disjoint union of Aj^^ , ^^^^ 

(5) \Ari\ > e''\A\ for 77 G ^=2 

or In case (II): > m^, with equality if desired 

(6) Bk = {By : G ^>2) (note that B^ is defined at stage + 1) 

(7) Each C G is e-good 

or In case (II): B^ is "!^"'"^ -good 

(8) for all T} G ^~^2, a G ^,,-(0) implies t(a, S^) = and a G ^r)^{i) implies t{a,B^) = 1. 

Note that t(a, -B^) is well defined in (8) as is good. When A; = 0, define ^4^^ = A. Now suppose 
= m + 1. In Case (I), suppose that for all 77 G "^2, Aj^ fails to be e-excellent. By definition, for 
each such r/, there is some set Bfj C G which is e-good and such that 

\{a G Ar; : t{a,B) / 1}| > e|A,,| and \{a G A, : t{a,B) ^ 0}| > e|A^| 

again noting that these two sets partition Aj^ by goodness of B^j. So we can define A^^^j^^ := {a G 
Ar, : t(a, Brj) = i} for i G {0, 1}. 

Meanwhile, in case (II), we are interested in whether is -excellent rather than e-excellent; 
if not, there an "^^^ -good set Br^ such that the displayed equation holds with "> "^^^ in place 
of "> e|A^|". In this case, choose ^r,-(i) to be a subset of {a & A^^ : t(a, 5^) 7^ i} of cardinality 
nik+i, for i = 0, 1. 

This completes the inductive step, and satisfies conditions (l)-(8). 

We now show that the induction cannot continue indefinitely. Suppose we have defined Aj^ for 

r] G ''**2 and B,y for v G ^**>2 satisfying (l)-(8). For each 7], since we assumed cither (I) e^** \A\ > or 
(II) that l^^l = m£ > m.fc^^„i > /c**, we have that A^^ 7^ so we may choose a,, G Aj^. Furthermore, 
for each v G ^**>2 and r] £ ^'"2 such that u <r], we may define 

Uy^ri = {b£By: (a^Rb) ^ t{a^,B^)} 
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i.e. the set of elements in B^i which do not relate to in the expected way. By assumption 
"^^^ < e, so in both Cases (I) and (II), \Uu,r]\ < ^\Bu\ by the goodness of B^. Hence for any such v, 



< 2''"e\BJ < \BJ 



by the hypothesis on the size of e. In particular, for each v G ^**>2 we may choose an element 
^ B^ \ y]{Uu^n V <r] ^ ^**2}. Now the sequences (ur) : r) & fc**2) and {hy : v ^ ^**>2) contradict 
Definition 12.91 i.e. the choice of A;**. 

We have shown that for some k <^ /c^^ the induction must stop. Hence for some G ^2, is 
e-excellent [if in case (II), is '"''^^ -excellent, so in particular e-excellent] and satisfies condition 
(5), which completes the proof. □ 

Remark 5.5. Note that the tree construction just given naturally tends away from uniform size 
since we do not know when or where the induction will stop. 

By definition, if A is e-excellent and B is C-good, they will interact in a strongly uniform way, 
namely, most of the elements of A will have the same average t{a,B) G {0, 1} over B. Let us give 
this a name: 

Claim 5.6. If A is e-excellent and B is C,-good then the pair [A,B) is (e, (^)-uniform, where we say 
that [A^B) is (e, C)-uniform if for some truth value t = t{A,B) G {0, 1} we have: for all hut < e\A\ 
of the elements of A, t{A,B) =t{a,B). 

In other words, for all but < e\A\ of the elements of \A\, for all but < C\B\ of the elements of B, 
(aRb) = {t{A, B) = 1). When e = C, we will just write e-uniform. 

Proof. By the definition of excellent. □ 

Remark 5.7. So in some ways "{A,B) is {e, e) -uniform" is stronger than being e-regular; see also 
Claim 5.17 below. 

Discussion 5.8. At this point, we have a way to obtain e-excellent subsets of any given graph, 
whose sizes vary along a fixed sequence. Below, we will extract a collection of such sets as the first 
stage in obtaining a regularity lemma. However, the goal is a partition into pieces of approximately 
equal size, which will require an appropriate further division of the first-stage collection of e-excellent 
sets. In preparation, then, we now apply several facts from probability to prove that sufficiently large 
e-excellent sets can be equitably partitioned into a small number of pieces all of which are e' -excellent 
for e' close to e. 

Fact 5.9. Assume p,q > 0. If \A\ = n, B C A = p, m < n, ^ > q, A' is a random subset of A 
with exactly m elements, then 

^ , /\A'nB\ /\B\ ^ \B\ ^ 

can be modeled by a random variable which is asymptotically normally distributed. 

Proof. That is, our hyper geometric distribution (sampling m elements from a set of size n without 
replacement) will be asymptotically approximated by the binomial distribution (sampling with 
replacement), and therefore by the normal distribution. See Erdos and Renyi [2j p. 52, Feller p. 
172, Nicholson [12]. Note that in our case m will remain relatively large as a fraction of n. □ 

Fact 5.10. (Vapnik and Chervonenkis, [TT]) Let X be a set of events on which a probability measure 
Px is defined. Let S be a collection of random events, i.e. subsets of X, measurable w.r.t. Px- 
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Each sample xi, . . . xi and event A ^ S determines a relative frequency of A in this sample. Let 
P{A) be the probability of A and let vr^^) = sup{|i;J^ - P{A)\ -.AeS}. 

For each A & S and finite sample = xi, . . . Xr of elements of X, A is said to induce the subset 
o/{xi, . . .Xj.} consisting of those elements Xi which belong to A. The number of different subsamples 
of Xr induced by sets of S is denoted A'^(xi, . . . Xr)- Define m^{r) = max{A'^(j;i, . . . Xr)}, where 
the maximum is taken over all samples of size r. 

Then a sufficient condition for the relative frequencies of events in S to converge uniformly 
over S (in probability) to their corresponding probabilities, i.e. for it to be true that for any e, 
lim^_j.oo Prob(7r(^) > e) = 0, is that there exist a finite k such that m^{£) < £^ + 1 for all £. 

Remark 5.11. The connection between the condition of Vapnik-Chervonenkis and the independence 
property, defined in Remark \cl.l^ above, was observed and developed by Laskowski 

Fact 5.12. (Rate of the almost sure convergence) 

(1) p. 272) Given k from the last paragraph of Fact [5. 1 (A if i satisfies 



then in any sample of size at least i, with probability at least (1 — rj), the relative frequencies 
differ from their corresponding probabilities by an amount less than simultaneously over 
the entire class of events. 
(2) Bounds on the error of the normal approximation to the hypergeometric distribution may be 
found in Nicholson [12] p. J^IJ^ Theorem 2. 

Claim 5.13. (Random partitions of excellent sets) 

(1) For every e, C there is Ni such that for all n > Ni = Ni{eX), if A C G, \A\ = n, A is 
e-good, n > m > loglog(n), if we randomly choose an m-element subset A' from A then 
almost surely A' is (e + Q-good. Moreover, we have that h ^G =^ t{b,A') = t{b,A). 

(lA) That is, in part (1), for each ^ € (0, 1) there is N2 = N2{€,C,(,) such that the probability of 
failure is < 

(2) Similarly for "excellent" replacing "good". 

(3) In particular, for all e' > e and r > 1 there exists N = N{e,e',r) such that if \A\ = n > N, 
r divides n and A is e-excellent, there exists a partition of A into r disjoint pieces of equal 
size each of which is e' -excellent. Note that N{e,e',r) increases with r. 

Proof. (1) Call B C A an exceptional set if there is b £ G such that B = {a € A : aRb ^ t(b, A)} 
and \B\ > em. It suffices to show that almost surely A' satisfies: for all exceptional sets B 



By Fact 15.91 for n, m sufficiently large, we may approximate drawing a set of size m by the sum 
of m independent, identically and normally distributed random variables, where the probability of 
X G -B is just Since G has the non-Zc,,, -order property. Claim [23] in the case where G = A, 

A = A' shows that the Vapnik-Chervonenkis sufficient conditions (Fact I5.10p are satisfied. (Recall 
the definition of exceptional set from the first line of the proof.) 
(lA) By Fact [521 and Fact KW 

(2) Follows by the "moreover" in the previous clause. 

(3) Let e be as given, ( = e' — e, and ^ = Let us verify that N = N2{e,C,,^) suffices. First, 
randomly choose a function h : A ^ {0, . . . r — 1} such that for all s < r, |{a G A : h{a) = s}\ = 
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Then each s < r induces a random choice of a subset of A, since for each s < r we have h ^{s) & [A]r . 
Since h was random, for each given s, each B £ [A]^ is equally probable. By part (1), for each 
s <t 



!-(,< Prob{/i"^(s) is (e + C)-excellent} 



and therefore 



1 - < Prob{/\ h ^(s) is (e + C)-excellent} 

s<r 

But since 1 — = 1 — > 0, there exists an h which works, i.e. an h such that for each s < t, 
h''^{s) is (e + C)-excellent. Since € + ( = €', this finishes the proof. □ 



The next claim forms the core of the proof of Theorem 15.181 The statement is laid out so as to 
make the strategy of construction clear (based on the claims established so far). A less transparent, 
but more compact, list of the requirements in this claim is summarized in Corollary 15.151 For the 
Theorem, it remains to construct an appropriate sequence (rrij : i < A:**) which respects the various 
bounds collected here, and to show that this can be done while keeping m** sufficiently large relative 
to \A\. 



Claim 5.14. Assume that e < e' < 2 Suppose that A C G, \A 



n. 



(1) Let {rrii : i < /c*^,) be a sequence of natural numbers such that rrii^i < errii for i < k^^, and 
let := uiq, m^t, := m^^^^i > k^^. Then there is A such that: 

(a) A= {Ai:i< for some j(*) < ^ 

(b) For each i, Ai Q A and \ Ai\ S {mi : £ < k^^} 



i +3 



Ai r\A 



(d) each Ai is e-excellent 

(e) hence if i 7^ j < then the pair {Ai,Aj) is {e,e)-uniform 

(f) B := A \ \J{Ai : i < i{*)} has < m* members 



(as in Claim [^.IcSp 



(lA) Suppose further that: 

(i) m**|mA; for each k < 

(ii) mk„-2> N = N{e,e', 

(iii) log log < m^,* 

Then for some i{*) with < i{*) < there is a further refinement of the partition 
from (1) into i{*) disjoint pieces (in slight abuse of notation we will now use {Ai : i < i{*)) 
to refer to this new partition) such that for each i < \Ai\ = m,,*. Furthermore, each of 
these new pieces Ai is e' -excellent. 

(2) Let {Ai : i < i{*)) be the partition into equally sized e' -excellent pieces from (lA). Then 
there exists a partition {Bi : i < i{*)) of the remainder B, allowing Bi = % for some i [i.e. 

may be 0) such that 



\B\ 



\Bi\ G 



\B\ 



+ 1 



Let A'i = AiU Bi for i <i{*). Then: 

(a) {A'- : i < i{*)) is a partition of A 

(b) the sizes of the A'- are almost equal, i.e. 



\A'\ 



< 1 
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(c) if we let 

then i ^ j < implies {A'-,A'j) is -uniform. 



i{*) 



(3) //, moreover, m^,* > ^ and m^, < , then C < 3e' , where C is as in (2)(c). 

Proof. (1) Applying Claim [5741 we try to choose a sequence of e-excellent sets Ai, each of size 
for some £ < k^^, by induction on i from Ci := A \ IJj<i ^j- We can continue as long as \Ci\ > m^. 
Note that condition (e) is immediate, for all pairs {Ai,Aj) without exceptions, by Claim [521 

(lA) By Claim IF. 13( 3). Note that in the application below, we will build all relevant sequences 
of ms to satisfy m=K* ~ e^**m* so that N = N[e,e' ,e~^**) can be computed, if desired, before the 
sequence is chosen. 

(2) Immediate: the partition remains equitable because the Ai all have size m=K*, and C, bounds 
the relative size of a "bad" subset of any given Ai . 

(3) Given the assumption of an equitable partition from (2)(b), it would suffice to show that for 
every i, \Bi\ < 2e'\Ai\, as then we would have 

e'\Ai\ + l^^l ^ e'\A^\ + 2e'\A^\ ^ 

We verify that the assumption on m=K is enough to give this bound. By definition, as the BiS arise 
from an equitable partition of the remainder B, \Bi\ < -^7^^ + !> where is the number of 
components from the partition (lA), by (2) above. Since the components Ai from (lA) all have 
size m„, and \B\ < m* - 1, we can bound i{*) by ^ > i{*) > > Thus 

I „ I , ^ — 1 1 \ ~ "i* A ~^ , — I f m^, — \\ f n — m^\~^ 

\Bi\ — \ < — — ^ — < (m* — 1) and so — — < 



We had assumed that m^, < ^j^^, and so: 

m^{l + e') < e'n+ 1 

— 1 < (n — m^)e 
m» - 1 ^ ^1 
n — m* ~ 



We had also assumed that \ < m**, i.e. ^— < e'. Since \Ai\ = m^,* (so ^"^'J 1 ^ = -l? 
conclude that 

\Bi\ — 1 1 , , , 
< + <e' + e' = 2e' 



we 



\Ai\ n — m^: m* 

which completes the proof. □ 

Corollary 5.15. To summarize the requirements of Claim [57771 , suppose that and therefore /c*^, 
are fixed in advance, G is a graph with the non-k^-order property, and that we are given: 

(1) ei, £3 G M such that < £3 < £2 := y < ei < 2~^" 

(2) A sequence of positive integers {m£ : i < A;^,*) such that: 

(a) m^+i < e^mi for each i < fc** 
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(b) m^,*|m^ for each i < k^^ 

(c) log log mo < m^,* 

(d) m** := mk,^-i > max(A;**, ^) 

(e) mfc,_2 > iV(e3,e2,^), from ClaimlUMV 
(3) A Q G, \A\ = n where n satisfies niQ < ^f^^ 

Then there exists i{*) < and a partition of A into disjoint pieces {Ai : i < such that: 

• for all i < j < i{*), \\Ai\ — \Aj\\ < 1 

• each Ai is ei-excellent 

• Each pair {Ai,Aj) is ei-uniform 

Proof. By Claim [5?T^ using e = €3, e' = €2 and 3e' = ei; note that the partition we obtain was 
called {A[:i< i{*)) in Claim EH □ 

Discussion 5.16. In practice, we are given e = ei, and then choose €3 to run the proof of Corollary 
\5.15i The role of the respective es appears in conditions (2)(a) and (2)(e) of this Corollary. On one 
hand, €3 determines the rate of decrease of the sequence of ms, thus the size ofm^^^, and ultimately 
the number of components in the partition: so one would usually want to choose €3 close to ei = e. 
On the other hand as €3 approaches ei, the lower bound on the size of the graph A may rise, via the 
N from (2)(e), which comes from Claim ed. 13^( 3). 

Before stating the main result of this section, Theorem I5.18|, we consider more explicitly the 
relation of e-uniformity to e-regularity. As the following calculation shows, r/-uniform pairs will be 
/9-regular when p (the parameter for a lower bound on the size of a subset chosen) is sufficiently 
large relative to r/ (the parameter for an upper bound on the number of non- uniform edges). As 
mentioned above, uniformity is somewhat more precise in our context for large enough graphs, as 
the densities of sufficiently large e-regular pairs will be near or 1. 

Claim 5.17. Suppose that e,C,^ £ (Oi ^'^'^ pair {A,B) is {e,C,)-uniform. By uniformity, 
there is a truth value t{A,B) € {0,1}. Let Z := {(a, 6) ^ {A x B) : aRb ^ t} and likewise let 
Z' := {{a,b) G {A' x B') : aRb ^ t}. Suppose also that A' C A, \A'\ > C\A\, B' C B, \B'\ > (,\B\, 
and < |. Then: 

(1) m<^ + ^ 

2 

In particular, if the pair [A, B) is CQ-uniform for eo < ^ ^^cn [A, B) is also e-regular. 

Proof Let A' , B' be given. For a £ A, let Wa = {b £ B : aRb ^ t{A,B)}, and let U = {a e A : 
\Wa\ > e\A\}. So \U\ < €\A\, and a eA\U =^ \Wa\ < C\B\. Since 



Z CU xBU |J{(a, b) £AxB :beWa,a^U} 
Z' CU xB'u |J{(o, b)eA' xB:beWa,a^U} 
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we can bound the cardinalities as follows: 



\Z\ < \U\ ■ \B\ + \A\ ■ max{|Wa| : a G 

\Z\ AM C\B\ 

< TTT + TTTT = e + C 



\AxB\ \A\ \B\ 
and likewise 

\Z\' \U\\B'\ + \A'\-ma^{\Wa\:aeU} 



\A' X B'\ \A'\\B'\ 

e\Am\+^m\B\ \A\\B\ ^ \A\\B\ ^ g(g + Q ^ e + C 

\A'\\B'\ ' \A\\B\ \A'\\B'\ e C 



by the assumption on the size of A' , B'. This completes the proof of (1) and (2). 

For the "in particular" clause, let d{X, Y) = be the usual edge density. We have shown 

that if t{A,B) = 1, d{A,B) > 1 - (e + C) while d{A',B') > 1 - and likewise if d{A,B) = 0, 
diA,B) < (e + C) while d{A',B') < Thus the difference in density \d{A,B) - d{A',B')\ is 

bounded by If {A,B) is (eo, eo)-uniform and e is such that \A'\ > e\A\, \B'\ > e\B\ where 

2 2 

Co ^ Y then the difference in densities is bounded by ^ = ^ > which completes the proof. □ 

We now give the main result of this section. Recall the definitions of non-A;,,-order property 
(Definition 12. 2p . k^^, (Definition 12. 9p . e-excellent (Definition 15. 2p . and e-uniform (Claim [521) • 

Theorem 5.18. Let and therefore /c** be given. Let G be a finite graph with the non-k^-order 
property. Then for any e > there exists m = m(e) such that for all sufficiently large A G, there 
is a partition {Ai : i < i{*) < m) of A into at most m pieces, where: 

(1) for all i,j < i{*), \\A, \ - \Aj\\ < 1 

(2) each of the pieces Ai is e-excellent 

(3) all of the pairs {Ai,Aj) are {e, e) -uniform 

(4) ife<^, thenm<{3 + e){lf" 

Proof. Without loss of generality, assume e < 2izr • (This is necessary for Claim 15.151 which uses 
Claim [521) 

We proceed in stages. Let n = |^|. When hypotheses are made about the minimum size of n, 
these will be labeled (Hx) and collected in Step 5. 

Step 0: Fixing epsilons. When applying Corollary |5.15l we will use: £3 = |, e2 = |, and ei = e. 

Step 1: Fixing q. Given e^, let q = ^ G N. It follows that ^ > q > and thus ^ — | — ^3- 
In particular, any sequence {mi : £ < k^^,) such that m^,* := rrik^^^i € N and = (/m^+i for all 
(■ < kt,^ will satisfy m^j^i = ^m£ < e^nii, mi E N for each (. < A;^,*, and m**|m^ for all £ < k^^. 

Step 2: Choosing m^^. In this step, the aim is to build a sequence {mi : £ < fc^,*) whose elements are 
as large as possible subject to the constraints (2)(a),(b),(d) and (3) of Corollarv 15.151 In keeping 
with prior notation, let m* := mg. Recalling £2 = 5 from Step 0, Condition 15. 15lf 3) asks that 

f n + 1 In en 

m* < — so it sufhces to choose m* < — 



l+§ 1+i 3+e 
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Let (HI) be the assumption that n is not too small (see Step 5). Then there exists c G N, c > A;** 
such that 

J, f en u , en 

q^" Ice (^-—-q^" 

V3 + e 3 + e 

Thus setting m^,* := max{c G N : c > A;**, c > ^, q^"~^c < j^} is well defined, and m** 
will belong to the half-open interval just given. Having defined m**, for each £ < let mi := 
gk,,-i-i^^^^ By Step 1, the are integer valued and satisfy the required conditions on divisibility 
and size. By choice of c, m^, = q^"~^m^^ satisfies the inequality 15.15^ 3). 

We have defined a sequence (m^ : I < fc^.^,) of positive integers which satisfies conditions (2)(a),(b),(d) 
and (3) of Corollary 15.151 We fix this sequence for the remainder of the proof, and proceed to cal- 
culate various bounds in terms of it. 

Step 3: Bounding m**. By the definition of m,,* in Step 2, — q^**-'^ < (;'^**~^m,=#, so assuming 
n is not too small [again (HI) in Step 5], 

_ 1 < ^ I' ^ 

Step 4-' Bounding Applying Step 3, an inequality from Step 1, and the definition of es, 



_ri_ ^ n ^ 2(3 + 6)g^"-i ^ 2(3 + 6) / 2_y**-' = (3 + e) f-^ 

<(3 + .)(f' " 



Note that a choice of 63 closer to 62 would slightly improve this bound, at the cost of increasing the 
threshold size of n in (H3) of Step 5. 

Step 5: Requirements for the lower bound on n = \A\. We collect the necessary hypotheses on the 
size of the graph: 

(HI) n is large enough to allow for the choice of m* in the interval from Step 2 while preserving 

it suffices that n > {k^^, + l)g^"-i (^), which ensures ^ — q'^**-^ > k^.^q'^**~^ 

and also ensures that n > 2q^**~^ (^7^)' calculation in Step 3 

(H2) n is large enough for the sequence (m^ : I < k^^) to satisfy log log m* < m^^^: 

it suffices that n > (loglogg'^"~-'^)(g'^"~^) (^) 
(H3) n is large enough for mfc^^_2 to satisfy condition (2)(e) of Corollary 15. 15t 

it suffices that n > A^(|, |, g*^**~^) • (^) • q^**'"^ where iV(-,-,-) is from Claim [5TT3] and 

incorporates the bounds from Fact 15.121 

Under these assumptions the sequence constructed in Step 2 will also satisfy conditions (2)(c),(e) 
of Corollary 15.151 By Step and Step 2, all the hypotheses of that Corollary are satisfied. 
Step 6: Obtaining the partition. Assuming n is sufficiently large, as described in Step 5, we have 
constructed a sequence (m^ : £ < fc^^,) so that the graph A and the constructed sequence satisfy the 
hypotheses of Corollary 15. 151 Thus we obtain a partition of A satisfying (1),(2),(3) of the Theorem. 
Condition (4) follows from Step 4, which completes the proof. □ 



30 



M. MALLIARIS AND S. SHELAH 



Corollary 5.19. Let G be a graph with the non-k^-order property. For every e € (0, 2) there are 
N,k as in Theorem \5.18\ such that if A G, \A\ > N, then for some m < k, there is a partition 
A = {Ai : i < m) such that each Ai is e-excellent, and for every < i < j < m, 

• \\Ai\ - \Aj\\ < 1 

• {Ai,Aj) is e-regular and 

• ifBi e andBj G [Aj]^'^^^^ then 

[d{B^, Bj) < e) V [d{B,, S,) > 1 - e) 

Proof. This is a slight weakening of Theorem 15.181 which also replaces "e-uniform," as defined in 
Claim [5^61 by the more familiar e-regular via Claim IF. 17i For e-excellent, see Definition 15.21 □ 
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